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Abstract 

We consider the problem of constructing optimal decision trees: given a collection of tests which 
can disambiguate between a set of m possible diseases, each test having a cost, and the a-priori 
likelihood of the patient having any particular disease, what is a good adaptive strategy to perform 
these tests to minimize the expected cost to identify the disease ? We settle the approximability of this 
problem by giving a tight 0{\og m)-approximation algorithm. The optimal decision tree problem was 
known to be n(logm)-hard to approximate, and previously (9 (log m) -approximations were known 
only under either uniform costs or uniform probabilities. 

We also consider a more substantial generalization, the Adaptive TSP problem. Given an underlying 
metric space, a random subset S of cities is drawn from a known distribution, but S is initially 
unknown to us — we get information about whether any city is in S only when we visit the city 
in question. What is a good adaptive way of visiting all the cities in the random subset S while 
minimizing the expected distance traveled? For this adaptive TSP problem, we give the first poly- 
logarithmic approximation, and show that this algorithm is best possible unless we can improve 
the approximation guarantees for the well-known group Steiner tree problem. We also give an 
approximation algorithm with the same guarantee for the adaptive traveling repairman problem. 

1 Introduction 

Consider the following two adaptive covering optimization problems: 

• Adaptive TSP under stochastic demands. (AdapTSP) A traveling salesperson is given a metric 
space iy, d), distinct subsets Si , ^2, . . . , C V such that Si appears with probability pi (and pi = 
1), and she needs to serve requests at this subset of locations. However, she does not know the 
identity of the random subset: she can only visit locations, at which time she finds out whether 
or not that location is part of the subset. What adaptive strategy should she use to minimize the 
expected time to serve all requests? 

• Optimal Decision Trees. Given a set of m diseases, there are n binary tests that can be used to 
disambiguate between these diseases. If the cost of performing test t G [n] is c,, and we are given 
the likelihoods {pj} je\m] that a typical patient has the disease j, what (adaptive) strategy should 
the doctor use for the tests to minimize the expected cost to identify the disease? 
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It can be shown that the optimal decision tree problem is a special case of the former problem (a 
formal reduction is given in Section [S]) In both these problems we want to devise adaptive strategies, 
which take into account the revealed information in the queries so far (e.g., cities already visited, or 
tests already done) to determine the future course of action — i.e., our output must be a decision tree. In 
contrast, a non-adaptive strategy con^esponds to a decision list. While some problems have the property 
that the best adaptive and non-adaptive strategies have similar performances, both the above problems 
have a large adaptivity gap ['lOj (the worst ratio between the performance of the optimal non-adaptive 
and optimal adaptive strategies). Hence it is essential that we seek good adaptive strategies. 

The optimal decision tree problem has long been studied (its NP-hardness was shown by Hyafil and 
Rivest in 1976 |[2TI . and many references and applications can be found in |[29l ). On the algorithms 
side, C? (log m) -approximation algorithms have been given for the special cases where the likelihoods 
{pj} are all equal, or where the test costs {cj are all equal |[T3l |26l HI |9l d H |29l [HI. But the 
problem has remained open for the case when both the probabilities and the costs ai^e ai^bitrary — the 
previous results only imply O ^log^^^ and O ^log(m^f^)^ approximation algorithms for the general 

case. An 0(logm)-approximation for general costs and probabilities would be the best possible unless 
NP C Dr/M£'[?i'^('°§'°s")] |4]; and while the existence of such an algorithm has been posed as an open 
question, it has not been answered prior to this work. 

In this paper, we settle the approximability of the optimal decision tree problem: 

Theorem 1 There is an 0{logm)-approximation for the optimal decision tree problem with arbitrary 
test costs and arbitrary probabilities; the problem admits the same approximation ratio even when the 
tests have non-binary outcomes. 

In fact, this result arises as a special case of the following theorem: 

Theorem 2 There is an 0{log^ nlogm)-factor approximation algorithm for the adaptive TSP problem. 

In this paper, we first prove the latter result for adaptive TSP, and then show how that algorithm special- 
ized to the optimal decision tree problem actually gives us the claimed 0(logm)-factor approximation. 

A Word about our Techniques. To solve the AdapTSP problem, we first solve the "Isolation" problem, 
which seeks to identify which of the m scenarios has materialized; once we know the scenario we can 
visit its vertices using, say, Christofides' heuristic. The idea behind our algorithm for Isolation is this — 
suppose each vertex lies in at most half the scenarios; if we visit one vertex in each of the m scenarios 
using a short tour (which is just group Steiner tree HH), we'd notice at least one of these vertices to 
have a demand — this would reduce the number of possible scenarios by 50%. This is necessarily an 
over-simplified view, and there are many details to handle: we need not visit all scenarios — visiting all 
but one allows us to infer the last one by exclusion; the expectation in the objective function means we 
need to solve a min-latency version of group Steiner tree; not all vertices need lie in less than half the 
scenarios. Another major issue is that moreover we do not want our performance to depend on the size 
of the probabilities, in case some of them are exponentially small, so we cannot just hope to reduce the 
measure of the remaining scenarios by 50%. Finally, we need to charge our cost directly against the 
optimal decision tree. All these issues can be resolved to give an 0{log^n ■ logm) approximation for 
Isolation (and hence for AdapTSP) with n nodes and m scenarios. The Isolation algorithm also involves 
an interesting combination of ideas from the group Steiner |[T4l Q and minimum latency ||2j |6j [TTl 
problems — it uses a greedy strategy that is greedy with respect to two different criteria, namely both the 
probability measure and the number of scenarios. This idea is formalized in our definition of the partial 
latency group Steiner (LPGST) problem, and its algorithm (see Section |4l). In Appendix lAl we show 
that both AdapTSP and Isolation are Q,{log^^^ n) hard to approximate even on tree metrics; our results 
are essentially best possible on such metrics, and we lose an extra logarithmic factor to go to general 
metrics, as in the group Steiner tree problem. 
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For the special case of the optimal decision tree problem, we show that we can use the min-sum set 
cover problem flT] instead of the min-latency version of group Steiner tree; this avoids an 0(log^?i) loss 
in the approximation guarantee, and hence gives us an optimal C?(logm)-approximation for the optimal 
decision tree problem. Our result further reinforces the close connection between the min-sum set cover 
problem and the optimal decision tree problem that was first noticed by Chakravarthy et al. lH. 

Finally, we consider the adaptive traveling repairman problem (AdapTRP), which has the same 
input as AdapTSP, but we want to minimize the expected average response time of the vertices in the 
materialized demand set. Note that we cannot first isolate the scenario and then visit all its nodes, since 
a long isolation tour may negatively impact the response times. However, we show that our techniques 
for AdapTSP are robust, and can be used to get a similar approximation algorithm for the AdapTRP. 

A brief outline of the paper layout: the results on AdapTSP and Isolation appear in Section |3l the 
algorithm for partial latency group Steiner appears in Section |4l and the hardness result in Appendix lAl 
We show how to use the AdapTSP and Isolation results to get improved algorithms for optimal decision 
tree in Section |5l The details for the adaptive traveling repairman problem are in Section |6l 

1.1 Other Related Work 

The optimal decision tree problem has been studied earlier by many authors, with algorithms and hard- 
ness results being shown by |[T3l llTl l26l l24l [T] |9j SI 121 HH ; as mentioned above, the algorithms in 
these papers give O (log m) -approximations only when either the probabilities or costs (or both) are 
polynomially-bounded. Table 1 in Guillory and Bilmes |[T8l gives a good summary of known approxi- 
mation ratios; in general, the best approximation guarantees are O ( log ) and O ( log(m ) . The 

\ "min / \ / 

C?(logm)-approximation for arbitrary costs and probabilities solves an open problem from these papers. 
The AdapTSP problem has not been studied earlier, to the best of our knowledge. 

There ai^e many results on adaptive optimization. E.g., |[T6l considered an adaptive set-cover prob- 
lem; they give an O (log «) -approximation when sets may be chosen multiple times, and an 0{n)- 
approximation when each set may be chosen at most once. This was improved in |[27| to C?(log^ n logm), 
and subsequently to the best-possible 0(log?i)-approximation by ll25l . also using a greedy algorithm. fTl 
considered adaptive strategies for stochastic routing problems using a greedy-like algorithm where they 
solved an LP at each step. ifTTl studied adaptive transmission in noisy channels (that have multiple 
states) to maximize the utility (i.e., the difference between success-probability and probing-cost). In all 
these problems, the adaptivity-gap is large, as is the case for the problems we considered here, and hence 
the solutions need to be inherently adaptive. 

The AdapTSP problem is related to universal TSP |l23l[T3l and a priori TSP |l22l[30l[3ll only in 
spirit — in both the universal and a priori TSP problems, we seek a master tour which we shortcut once 
the demand set is known, and the goal is to minimize the worst-case or expected length of the shortcut 
tour. The crucial difference is that the demand subset is revealed in toto in these two problems, leaving 
no possibility of adaptivity — this is in contrast to the slow revelation of the demand subset that occurs 
in AdapTSP. 

2 Notation 

Throughout this paper, we deal with demand distributions over vertex-subsets that are specified explic- 
itly. I.e. demand distribution & is specified by m distinct subsets {5, C yjJ'Lj having associated proba- 
bilities {/J/j-lj such that Y!iL\ Pi = ^- This means that the realized subset D C V of demand- vertices will 
always be one of where D = Si with probability pi (for all / G [m]). We also refer to the subsets 

{'^iViLi scenarios. The following definition captures all adaptive strategies. 
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Definition 1 (Decision Tree) A decision tree T in metric iy^d) is a rooted binary tree where each non- 
leaf node ofT is labeled with a vertex u &V, and its two children Uy^s and m„o correspond to the subtrees 
taken if there is demand at u or if there is no demand at u. Thus given any realized demand D (^V, a 
unique path Po is followed in T from the root down to a leaf. 

Depending on the problem under consideration, there are additional constraints on decision tree T and 
the expected cost of T is also suitably defined. There is a (problem-specific) cost Q associated with 
each scenario / G [m] that depends on path Ps-, and the expected cost of T (under distribution S)) is then 
YdLi Pi ■ Ci. For example in AdapTSP, cost C, corresponds to the length of path Ps-. 
Given a metric iy,d) and root r £ V, an r-tour is any tour that begins and ends at r. 

3 The Adaptive TSP and Isolation Problems 

The input to adaptive TSP is a metric {V, d) with root r G V and a demand distribution over subsets 
of vertices. The crucial aspect in this model is that information on whether or not there is demand at a 
vertex v is obtained only when that vertex v is visited. The objective is to find an adaptive strategy that 
minimizes the expected time to visit all vertices of the realized scenario drawn from 

We assume that the distribution ^ is specified explicitly with a support-size of m. The more general 
setting would be to consider black-box access to distribution however, as shown in 1281 . AdapTSP 
under general black-box distributions admits no o{n) approximation if the algorithm is restricted to 
polynomially many samples. This is the reason we consider an explicit demands model for Moreover, 
AdapTSP under explicit demands still contains interesting special cases such as optimal decision tree 
problem. One could also consider AdapTSP under independent demand distributions; note that in this 
case there is a trivial solution: visit all vertices having non-zero probability along an approximately 
minimum TSP tour. In terms of Definition[T] we have: 

Definition 2 (The AdapTSP Problem) Given metric {V, d), root r and demand distribution Si, the goal 
in AdapTSP is to compute a decision tree T in metric with the added conditions that: 

• the root of T is labeled with the root vertex r, and 

• for each scenario i £ [m], the path Pg. followed on input Si £ support{&) contains all vertices in 
Si. 

The objective function is to minimize the expected tour length YliL\ Pi ' ^{PSi)' where d{Psj) is the length 
of the tour that starts at r, visits the vertices on path P^. in that order, and returns to r. 

A closely related problem is the Isolation problem. This has the same input as the AdapTSP prob- 
lem, but the goal is not necessarily to visit aU the vertices in the realized scenario, but just to identify the 
unique scenario that has materialized. 

Definition 3 (The Isolation Problem) Given metric {y,d), root r and demand distribution Si, the goal 
in Isolation is to compute a decision tree T in metric iy,d) with the added conditions that: 

• the root of T is labeled with the root vertex r, and 

• for each scenario i £ [m], the path Ps^ followed on input Si ends at a distinct leaf-node of T. 
The objective function is to minimize the expected tour length IsoTime(r) := YULiPi ■'^(^5,). where 
d{Ps^) is the length of the tour that starts at r, visits the vertices on path Ps^ in that order, and returns to 
r. 

Note the only difference between this definition and the one for AdapTSP is that the tree path Ps- 
need not contain all vertices of Si, and the paths for different scenarios should end at distinct leaf-nodes. 
We first prove the following simple fact relating these two objectives. 

Lemma 3 An a-approximation algorithm for Isolation implies an (a + |) approximation for AdapTSP. 
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Proof: We first claim tliat any feasible solution T to AdapTSP is also feasible for Isolation. For this it 
suffices to show that P5. / P5. for any two scenarios /, j G [m] with / / j. Suppose (for a contradiction) 
that paths P5. = = 71 for some / 7^ j. By feasibility of T for AdapTSP, path n contains all vertices in 
SiljSj. Since 5,- 7^ 5j, there is some vertex in {Si\Sj) \J{Sj\Si); let u G Si\Sj (the other case is identical). 
Consider the point where 71 is at a node labeled u: then path Pg- must take the yes child, whereas path P5 
must take the no child. This contradicts the assumption P5. = Psj = n. Thus any solution to AdapTSP 
is also feasible for Isolation; moreover the expected cost remains the same. Hence the optimal value of 
Isolation is at most that of AdapTSP. 

Now, using an a-approximation for Isolation, we obtain a strategy T' that isolates the realized sce- 
nario and has expected cost a • Opt, where Opt denotes the optimal value of the AdapTSP instance. 
This suggests the following feasible strategy for AdapTSP: 

1. Run T' to determine the realized scenario k € [m], and return to r. 

2. Traverse a |-approximate TSP tour jSl on vertices {r}\JSk- 

From the preceding argument, the expected length in the first phase is at most a • Opt. The expected 
length in the second phase is at most | ^"^j pi ■ Tsp(5, ), where Tsp(5,) denotes the minimum length of 
a TSP tour on {r} |J5,-. Note that ^"^j pi • Tsp(5',) is a lower bound on the optimal AdapTSP value. 
Hence we obtain a strategy that has expected cost at most (a + |)Opt, as claimed. ■ 
In the next few subsections, we give an 0{log^n logm)-approximation algorithm for Isolation, which 
by this lemma implies the same result for AdapTSP. In Appendix lAl we complement this result by 
showing that AdapTSP is at least as hard to approximate as the well-known Group Steiner Tree problem, 
for which the best approximation ratio is 0{log'n logm). 

3.1 Approximation Algorithm for the Isolation Problem 

Recall that an instance of Isolation is specified by a metric {V,d), a root vertex r G V, and m scenarios 
{Si}1Li with associated probability values {piYlL^. At a high level, our approach is a simple divide- 
and-conquer based one. The algorithm for Isolation is recursive: given an instance, it first develops a 
strategy to generate several sub-instances from this instance, each given by some proper subset M C [m] 
of scenarios, with associated probabilities {qi}ieM where Y^ieM^i = 1- (We refer to such a sub-instance 
of the original Isolation instance as (M, {qi}ieM)-) We then recursively build an isolation strategy within 
each sub-instance. The real question is: How do we generate these sub-instances so that we can charge 
these to the cost of the best decision tree? 

A useful subroutine to address this and solve Isolation will be the partial latency group Steiner 
(LPGST) problem, where we are given a metric g groups of vertices {X, C V}f^i with associated 

weights {w,}f^[, root r G V, and a target h < g. A group / G [g] is said to be covered (or visited) by 
r-tour T if any vertex in X, is visited, and the arrival time of group / is the length of the shortest prefix of 
T that contains an X;-vertex. The arrival times of all uncovered groups are set to be the tour-length. The 
weighted sum of anival times of all groups is termed latency of the tour, i.e., 

latency (t) = I,- covered Wi ■ arrival time^iXj) + Y,i uncovered Wi ■ length(T). (3.1) 

Note that this objective function generalizes the standard min-latency objective, where we have to cover 
all groups. In Section|4]we prove the following result: 

Theorem 4 There is an {0(log'^n),4)-bicriteria approximation algorithm for LPGST. I.e., the tour 
output by the algorithm visits at least | groups and has latency at most 0{\o^n) times the optimal 
latency of a tour that visits h groups. 

In this subsection, we present the algorithm for Isolation assuming this result for LPGST. Let us 
begin by showing the partitioning algorithm. 
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The Partitioning Algorithm The algorithm Partition (given below as Algorithm [T]) finds an r-tour T 
in the metric such that after observing the demands on r, the number of scenarios that are consistent with 
these observations is only a constant fraction of the total. E.g., if there was a vertex v such that w 50% 
of the scenarios contained it, then visiting vertex v would reduce the candidate scenarios by « 50%, 
irrespective of the observation at v, giving us a tangible notion of progress. However, each vertex may 
give a very unbalanced partition or such a vertex may be too expensive to visit, so we may have to visit 
multiple vertices — this is the basic idea in algorithm Partition. 



Algorithm 1 Algorithm Partition( (M,{^,};£m) ) 



1: letg = \M\. For each v G V, define F,, := {/ g M | v G 5,}, and D, | J^^^^ ![ ^ | 

2: for each / G M, set X,- {v G V | / G D,,}. Note that either the presence of a demand at some vertex v 
reduces the number of still-possible scenarios by half, or the absence of demand does so. To handle 
this asymmetry, this step takes scenarios and "flips" some vertices to get X,. 

3: run the LPGST algorithm (Theorem|4l) with metric {y,d) with root r, groups {X,},gM with weights 
{qi]ieM, and target |M| - 1. 

let T := r, vi , V2 , • • • , Vf_ i , r be the r-tour returned. 

4: let {PJUl be the partition of M where := | ^'^ y ifk^t~ 



5: return tour z and the partition {PkYk= 



The Isolation Algorithm To follow a divide-and-conquer strategy, the final algorithm is fairly natural 
given the partitioning scheme. The algorithm IsoAlg (given as Algorithm |2l) proceeds in several phases. 
In each phase, it maintains a candidate set M of scenarios such that the realized scenario lies in M. 
Upon observing demands along the tour produced by algorithm Partition (in StepO, a new set M' (IM 
containing the realized scenario is identified such that the number of candidate scenarios reduces by a 
constant factor (i.e. \M'\ < | • |M|); then IsoAlg recurses on scenarios M'. After C?(logwi) such phases 
the realized scenario would be correctly identified. 

Algorithm 2 Algorithm lsoAlg(M, {qi}ieM) 
1: If |M| = 1, return this unique scenario as realized. 
2: run Partition (M,{^,},eA/) 

let T = (r, vi , V2 , • • • , Vf_ 1 , r) be the r-tour and {Pk}{^i be the partition of M returned. 
3: let q'j := J^iePk 1i for all j G 1 . . . 

4: traverse tour t and return directly to r after visiting the first (if any) vertex v^.* (for ^* G [? — 1]) that 
determines that the realized scenario is in P^* C M. If there is no such vertex until the end of the 
tour T, then set k* -^t. 

5: run IsoAlg (Pjt*,{-^};gp^,) to isolate the realized scenario within the subset Pk*. 



Note that the adaptive Algorithm IsoAlg implicitly defines a decision tree too: indeed, we create 
a path (r,vi,V2,--- ,V/_i,v, = r), and hang the subtrees created in the recursive call on each instance 
(/\, {|f }) from the respective node Vk- 

Remark on possible simplification. We use the LPGST problem as a subroutine in solving Isolation. 
The reason behind this is that we measure progress in the recursive scheme IsoAlg as the number of 
candidate scenarios, whereas in computing the objective we need to use the probabilities of scenarios. A 
seemingly simpler approach is to use in place of LPGST, a 'density' Group Steiner Tree problem, with 
profits being either (A) uniform (corresponding to measuring number of scenarios) or (B) the scenario 
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probabilities. An 0(log^ ?i)-approximation for such a density Group Steiner problem was achieved in 151, 
so this could be used directly instead of the LPGST result in Section ID However there are non-trivial 
issues with these potential simplifications- 

A. If we were to use unit-profits, then it is no longer possible to charge the length of the solution 
(which covers at least constant fraction of the groups) to the optimal adaptive solution. This is 
because the expected cost of the optimal solution is weighted by the scenarios' probabilities (see 
Claim IS. 

B. If we were to use probabilities as profits, then we could charge the solution (that covers constant 
probability of groups) to the optimal adaptive solution. But in this case, the progress in a single 
phase would only reduce the probability mass on candidate scenarios by a constant factor; so the 
number of phases can only be bounded by C?(log y—) where p„,,„ is the minimum probability of 
any scenario. This would still work if we could preprocess the instance to ensure poly-bounded 
probabilities. However, as shown by the following instance, this preprocessing appears quite 
non-trivial even in the special case of the optimal decision tree problem. There is a stai^-metric on 
vertices V = {r} U{V(}"=/ centered at r, and d{r,Vi) = 2' (for all / G [n — 1]). There are n scenarios: 
Sj = {v,} for / G [n — 1] and Sn = 0, with probabilities p,- = i (for / G [n — 1]) and p„ = 

This is also the reason why our final algorithm interleaves both greedy objectives: number and proba- 
bilities of scenarios. See Section|4]for more details. 

3.2 The Analysis for Algorithm IsoAlg 

We now prove the performance guarantee and correctness of Algorithm IsoAlg. By the construction in 
algorithm IsoAlg, it follows that the realized scenario is identified after O(logm) recursive calls to IsoAlg. 
To complete the analysis, we need two additional ideas: (1) relating the LPGST and Isolation objective- 
values in each call to IsoAlg (which bounds the cost in a single sub-instance), and (2) establishing a 
subadditivity property of Isolation, that bounds the expected cost across all 'parallel' sub-instances (i.e. 
those in the same phase). 

For any instance ^ of the isolation problem, let lsoTime*(^) denote its optimal value. 

Claim 5 For any instance ^ = {M,{qi}i^M), the optimal value of the LPGST instance considered in 
Step\3}of Algorithm Partition(^) is at most lsoTime*(^) — i.e., the LPGST instance costs at most the 
isolation instance. 

Proof: Let T be an optimal decision tree corresponding to Isolation instance and hence IsoTime* (^) = 
IsoTime(r). Note that by definition of the sets {Fy,Dy}^.^v, any internal node in T labeled vertex v has 
its two children Vyes and v„o corresponding to the realized scenario being in either or M \ (in that 
order), or equivalently, Dy or M \ Dy (now not necessarily in that order). 

Consider an r-tour a that starts at the root r of T and moves from each node v to its unique child that 
corresponds to M\Dy, until it reaches a leaf-node /, when it returns to r. Let the vertices in this tour a 
be (where uj is the last vertex visited in T before leaf /). Since T is a feasible decision 

tree for the isolation instance, the leaf / is labeled by a unique scenario a G M such that when T is run 
under demands Sa, it traces path from the root to leaf node /. Moreover, every scenario b G M\{a} gives 
rise to a root-leaf path that diverges from the root-/ path (i.e. a). From the way we constructed a, the 
scenarios that diverged from a were precisely U;(^jD„^ , and hence U;(^jD„^ =M\ {a}. 

Now consider the r-tour a as a potential solution to the LPGST instance in Step [3l Since | U^^j 
D„jJ = \M\ — 1, the definition of the X,'s says that this path hits \M\ — 1 of these sets X,-, so it is indeed 
feasible. Also, the definition of the isolation cost implies that the latency of this tour a is at most the 
isolation cost IsoTime(r) = lsoTime*(^). ■ 

If we use a (pLPGST)4)-bicriteria LPGST algorithm, we get the following claim: 
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Claim 6 The latency of tour x returned by Algorithm Partition /5afmosfpLPGST"lsoTime*((M,{^,},gM)) 
Furthermore, the resulting partition {PkYk=\ each |/\| < ||M| /or each k G [t], when \M\ > 2. 

ProofrBy Claim|5] the optimal value of the LPGST instance in Step[3]of algorithm Partition is at most 
lsoTime*((M,{^,},gM)); now the approximation guarantee from Theorem |4] implies that the latency of 
the solution tour T is at most Plpgst times that. This proves the first part of the claim. 

Consider T := (r = vo,vi , • • • ,Vt-i,Vf = r) the tour returned by the LPGST algorithm in Step Q of 
algorithm Partition; and {PkYk=\ ^^'^ resulting partition. Claim|5]and Theorem |4] imply that the number 
of groups covered by T is | \J^^Dy^\ > ^^^^^ > ^ (when \M\ > 2). By definition of the sets D,,, it holds 
that |D,,| < \M\/2 for all v G V; moreover, since all but the last part Pt is a subset of some Dy, it holds 
that |P;t| < ^ for 1 < yt < f - 1. Moreover, the set P, has size \Pt\ = \M\ (U^<fD,.)| < ||M|. ■ 

Of course, we don't really care about the latency of the tour per se, we care about the cost incun^ed 
in isolating the realized scenario. But the two are related (by their very construction), as the following 
claim formalizes: 

Claim 7 At the end of Step^of \soA\g{M ,{qi}i^M)> the realized scenario lies in Pk*. The expected 
distance traversed in this step is at most 2pLPGST " IsoTime* ((M, {qi\ieM))- 

Proof: Consider the tour T := (r = vq, vi , • • • , v,_i,V; = r) returned by the Partition algorithm. Recall 
that visiting any vertex v reveals whether the scenario lies in D,,, or in M \D,,. In step dH) of algorithm 
IsoAlg, we traverse T and one of the following happens: 

• l<^*<f— 1. Tour returns directly to r from the first vertex Vk (for 1 <k <t — \) such that the 
realized scenario lies in . Note that since the scenario did not lie in any earlier D,. for j < k, 
the definition of P^ = Dv^, \ (Uy<|Dv.) gives us that the realized scenario is indeed in Pjt. 

• k* =t. Tour T is completely traversed and we return to r. In this case, the realized scenario does 
not lie in any of {D,^ | I < k < t — I}, and it is inferred to be in M \ (Uy<fDvJ, which is Pf by 
definition. 

Hence for k* as defined in Step|4]of lsoAlg(M, {qi}ieM), it follows that P^* contains the realized scenario; 
this proves the first part of the claim (and correctness of the algorithm). 

For each / G M, let a,- denote the arrival time for group X; in tour t; recall that this is the length of 
the shortest prefix of T until it hits an X,-vertex, and is set to the entire tour length if T does not ever hit 
Xi. The construction of partition {Pk}[^[ from T implies that 

= '^;); ^' ^ yi <k<t, 

and hence latency(T) = Y^ieM^li ' Gd- 

To bound the expected distance traversed, note the probability that the traversal returns to r from 
vertex Vk (for 1 <k<t — l)is exactly LiePj. with the remaining l^,gp, qi probability the entire tour T 
is traversed. Hence the expected length traversed is at most: 

2 • lUi LiePt Qi-{l.)=id{vj-uVj)) = 2- ZieM^i •«,• = 2 • latency (t), 

and by Claim[6l this is at most 2 • Plpgst • lsoTime*((M, {qi}ieM))- B 
Now, the following simple claim captures the "sub-additivity" of IsoTime*. 

Claim 8 For any instance {M,{qj}i^M) cind any partition {PkW^i ofM, 

LUi^r IsoTime* ((P,,{|},ep,)) < IsoTime* ((M,te},eM», (3.2) 
where cfk — T,iePk ^' all I < k < t. 
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Proof: Let T denote the optimal decision tree for the instance := {M ,{qf\i^M) ■ For each ^ G [t], 
consider instance '■= {Pk,{^}iePk)'^ one feasible adaptive strategy for instance is obtained by 
taking the decision tree T and considering only paths to the leaf-nodes labeled by {/ G Pk\. Note that 
this is a feasible solution since T isolates all scenarios Uj,^j/\. Moreover, the expected cost of such a 
strategy for is Y,iePk Y ' '^(^'') where 71,- denotes the tour traced by T under scenario / G Hence 
Opt(^yt) < Y.iePk Y ' ^(.^i)- Summing over all parts k £ [t], we get 

£ g', ■ Opt(/,) < t ^ • diTii) = I d{Kd = Opt(^o), (3.3) 

k=l k=l iePk Ik ieM 

where the penultimate equality uses the fact that {PkYk=\ ^ partition of M. ■ 
Given the above claims, we finally bound the expected cost of the strategy given by our algorithm. 

Theorem 9 The expected length of the strategy given by lsoAlg(M,{g,},gM) at most: 

2pLPGST -logg/v 1^1 • lsoTime*((M,{^;};eM))- 

Proof: We prove this by induction on \M\. The base case of \M\ = 1 is trivial, since zero length is 
traversed, and hence we consider \M\ > 2. Let instance J^o := {M,{qi}ieM)- For k G [t], consider the 
sub-instance J^/t := (^jt, {|f-}rePj), where q[ = Y^iePt^i- '^^e inductive hypothesis, for any k G [t], the 
expected length of lsoAlg(j^;t) is at most 2plpgst • logg/? \Pk\ ■ lsoTime*(j^<.) < 2plpgst • (logg/? \M\ - 
1) • lsoTime*(^i.)' since \Pk\ < ||M| (from Claim[6]as \M\ > 2). 

By Claim|7J the expected length traversed in Step|4]of IsoAlg(j^o) is atmost2pLPGST • lsoTime*(J^o)- 
The probability of recursing on J^^ is exactly q'j^ for each G [t], hence the expected length of IsoAlg(j^o) 
is at most: 

2pLPGST • lsoTime*(^o) + ^'k ' (e^P- length of \soMg{j^k)) 

< 2pLPGST • IsoTime* (^o) + lUi ^k ' 2pLPGST • (logg/y |M| - 1) • IsoTime* (J^t) 

< 2pLPGST • IsoTime* (^o) + 2Plpgst • (logs/? |M| - 1) • IsoTime* ( J^o) 
= 2pLPGST-log8/7|M|-lsoTime*(j^o) 

where the third inequality uses Claim [S] ■ 
Applying Theorem |9] on the original Isolation instance gives an 0(Plpgst • logm)-approximately 
optimal algorithm; using Theorem |4] implies Plpgst = 0(log^?i) and the following: 

Theorem 10 There is an 0(log^ n ■ logm) approximation for Isolation with n vertices and m scenarios. 
Hence, there is an 0(log^n • logm) approximation for AdapTSP. 

A comment about the optimality of these results: we show in Appendix lAl that AdapTSP (and by 
Lemma [21 Isolation as well) is as hard to approximate as the group Steiner problem, and hence any 
improvement here will make progress on that problem too. 

4 Solving Group Steiner Variants useful for Isolation 

The input to the well-studied group Steiner tree problem (GST) consists of metric (V,<i) with root r G V 
and groups {X,- C V}f^j; the goal is to compute a minimum length r-tour covering all group, where a 
group / G [g] is said to be covered if any vertex from Xj is visited by the tour. The best approximation 
guarantee known for GST is Oilo^n- logg) lfT4l . and there is an ^(log^^'^n) hardness of approxima- 
tion |[20l . Next, we study two variants of group Steiner tree that are useful in obtaining an approximation 
algorithm for Isolation. 
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• In the latency group Steiner tree problem, in addition to the GST input we are also given non- 
negative weights for each group, and the objective is to compute an r-tour covering all groups that 
minimizes the sum of the weighted arrival times, where the arrival time at a group is the distance 
along the tour from the root r to the first vertex in that group. 

• In the group Steiner orienteering problem, in addition to the GST input there are profits on groups 
and a length bound B; the goal is to compute an r-tour of length at most B covering maximum 
possible profit. 

It is somewhat intuitive that the latency group Steiner problem is related to Isolation: in order to isolate 
the scenarios we need to hit at least one vertex from at least g — \ scenarios, and we need to do this with 
small expected latency. However, we need an adaptive (rather than a lineaiO strategy, and we need to 
isolate scenarios rather than just hit them; nonetheless, this problem will be useful as a subroutine, and 
we show the reduction in Section 13.11 It is easy to show (following |[2j |6l [Til) that an approximation 
algorithm for the orienteering objective leads to an approximation for the latency objective. Hence 
we first give an algorithm for group Steiner orienteering. Then we show how this algorithm for group 
Steiner orienteering implies an algorithm for a generalization of latency group Steiner (namely LPGST), 
that is particularly convenient in the algorithm for Isolation. 

4.1 Group Steiner Orienteering 

The group Steiner orienteering (GSO) problem takes as input, a metric iy,d) with root r G V, g groups 
of vertices {X, C VYi=\ with associated profits {(/>;}f^i, and a length bound B. The goal is to compute an 
r-tour of length at most B that collects the maximum possible profit (from the groups that are visited by 
the tour). 

In this section, we show that the deterministic algorithm for group Steiner tree ||5l can be used within 
a standard greedy framework to obtain a tour slightly longer than the length bound B, and which gets a 
constant fraction of the profit obtained by the best length-B tour. In particular, an algorithm for GSO is 
said to be an (fl',Zj)-bicriteria approximation if on any instance of the problem, it outputs a solution of 
length at most b ■ B that obtains at least ^ times the best profit obtained by any r-tour of length B. 

Theorem 11 There is a (4, 0{log^ n))-bicriteria approximation algorithm for GSO, where n is the num- 
ber of vertices in the metric. 

Proof: Let Opt denote the optimal profit of the given GSO instance. We first preprocess the metric to 
only include vertices within distance B/2 of the root r: note that since the optimal tour cannot visit any 
excluded vertex, the optimal profit remains unchanged by this. Note that now the profit achieved by 
visiting any remaining vertex is at most Opt. Also every group is covered by some vertex (otherwise the 
group can be dropped), i.e. the sum of profits achieved by visiting all vertices is at least ^f^j 0,-. Hence 
there is some vertex with profit at least ^ Yfi= \ 0(» we have Opt > ^ 1^1=1 0i- 

Our algorithm for GSO follows a standard greedy approach (see eg. Garg |[T5l ). and is given as 
Algorithm^ In the following, let a := O(log^n), where the precise constant comes from the analysis. 

We now prove that Algorithm [3] achieves a (4,4a + 2) bicriteria approximation. By the description 
of the algorithm, we iterate as long as the total length of edges in S is bounded by aB; since the increase 
in length of 5 in any iteration is at most (a + 1) • S (recall that every vertex in A is at distance at most B 
from the root r), the final length d{S) < {2a + 1) -S, and the Euler tour costs at most twice that. This 
proves the bound on the length; to prove Theorem \TT\ it now suffices to show that the final subgraph S 
gets profit at least Moreover, at any iteration, let p{S) denote the profit of S, and d{S) its length. 
Since d{S) > aB upon termination, it suffices to show the following invariant for subgraph S over the 
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Algorithm 3 Algorithm for GSO. 



1: initialize solution 5^0. Mark all groups as uncovered. 
2: while length of S does not exceed a -B do 
3: set residual profits: 

~ Jo for each covered group / G [g] 

' \ 0,- for each uncovered group / G [g] 

4: solve the following LP for the residual GSO to obtain solution {x,y): 

max 

s.t. x{5{S))>yi ySCV -.r^S, XiOS; yie[g] 

yi < 1 V/ G [g] 

Hede-Xe <B 

x,y>0 



As described in lO (using probabilistic tree embedding), obtain a spanning tree T in metric {V,d) 
and capacities xt on edges of T such that: T^eeT^e-xrie) < 0{logn)-Y,e<^e-Xe < 0(log?i) -B, and 
these capacities xj support at least y, flow from the root r to the group X, for each / G [g]. 
round down each xjie) to an integral multiple of ^. 

For each group / G [g], let y- be the maximum flow value from r to the group X, under these 
capacities xj, and let f[{e) (for all e G T) denote the flow variables realizing this flow. 
Using xj and ffs, run the deterministic algorithm for 'density group Steiner' (||5] Section 3.2]) 
to obtain subtree A rooted at r that covers groups A such that: 

^^"^^ < a ^3 — , where d{A) := J^eeA^e and a = 0(log^n) 



9: if(/(A) < aBthen5'^5UA. 

10: if d{A) > aB then: (i) partition tree A into at most 2 • subtrees, each of length at most aB; 

let A' denote the subtree containing maximum profit, (ii) Set S' SUA' U {/} where / is the 

shortest edge from r to A'. 
11: Set S <r- S'. Mark all groups visited in S as covered. 
12: end while 

13: output an Euler tour of the subgraph S. 



iterations of the algorithm: 

At the beginning, inequality (14.41 ) holds trivially since d{S) = for 5 = 0. Consider any iteration 
where p{S) < Opt/4 at the beginning (otherwise the claim is trivial); the invariant now ensures that 
d{S) < aB/2 and hence we proceed further with the steps of the algorithm. Moreover, the optimal value 
of the LP solved in this iteration 0, ■ yi > Opt — p{S) > ^ ■ Opt. After the pruning step (that rounds 
down capacities) we reduce the capacity of each edge in T by at most ^ . Since any cut in the tree T 
has at most n edges, the capacity of any cut decreases by at most ^ after Step[6l and by the max-flow 
min-cut theorem, y- > y,- — ^ for each / G [g] (in Step|7]l. Furthermore, since edge capacities are integer 
multiples of p-, so are all the flow variables f[ corresponding to a maximum r-Xj flow for all / G [g]. 
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(This latter condition seems mysterious but is needed for technical reasons by the algorithm in [3 which 
we use in Step [Hi. Also note that this rounding down does not change the fractional profits much, since 

t^i-y'i>t^i-yi--,t^i>l-Opt-^^t^>>l-Opt-^>^ (4.5) 

where the second last inequality follows from ^Lf=i 0; ^ Opt (by the preprocessing). By the properties 
of the algorithm of Charikar et al. S, in Step ^ we get a subtree A with d{A)/p{A) < 2a^. We 
finish by handling the two possible cases (Steps l9l and [TOl): 

. If d{A) < aB, then p{S') = p{S) +p{A) > ^ • d{S) + ^ . = ^ • d{S'). 

• \fd{A) > aB, then by averaging, we have p{A') > j^Pi^) > so piS') > 

In either case the subgraph S' satisfies inequality (14.41) . and since 5 ^ 5" at the end of the iteration, the 
invariant holds for the subgraph S as well. ■ 
This shows that we have a deterministic algorithm for GSO that outputs a tour whose length violates 
the budget B by a factor of a = 0{log^n), but which achieves a constant fraction of the best profit 
possible. In the next section, we will use it to get an algorithm for a version of the min-latency group 
Steiner problem. 



4.2 Partial Latency Group Steiner 

In the partial latency group Steiner (LPGST) problem, we are given a metric (V^d), g groups of vertices 
{Xi C V}f^j with associated weights {vvijf^j, root r £V, and a target h < g. A group / G [g] is said to be 
covered (or visited) by r-tour T if any vertex in X; is visited, and the arrival time of such a group / is the 
length of the shortest prefix of T that contains an X,-vertex. The anival times of all uncovered groups are 
set to be the tour-length. The weighted sum of arrival times of all groups is termed latency of the tour, 
i.e., 

latency (t) = I,- covered Wi ■ arrival timer{Xi) + 1,- uncovered Wi ■ length(T). (4.6) 

The objective in LPGST is to compute a minimum latency r-tour that covers at least h groups. (Note 
that setting the target h to g would give us the standai^d latency group Steiner problem.) In this section, 
we present an (0(log^«),4)-bicriteria approximation algorithm for LPGST, i.e., the solution tour visits 
at least ^ groups and has latency at most C?(log^ n) times the optimal latency of a tour that visits h groups. 

In the following discussion, fix any optimal tour i^* for the given instance of LPGST: let Lat* 
denote the latency and Opt* the length of (^*. Let j3 := |, and p = 0{log^n) be such that the algorithm 
of Theorem [TT]is a (4,p) bicriteria approximation for GSO. Algorithmic is the approximation algorithm 
for LPGST. 

Claim 12 The tour T in Step\8\has length 0(p) • Opt* and latency 0{p) • Lat*. 

Proof: The length of each T^'^ is at most p • j3' (by the guarantees of Theorem fTTl) . and hence the 
length of T is at most pY,i<i^' = 0{p)^^^^ = C?(p)Opt*. Moreover, the increase in Step [8] ensures that 
d{l) > p • Opt*. (This increase of the tour length is important, since the contribution of any uncovered 
group to the latency of the tour T is the length of T. Hence, if the original length of T was much smaller 
than p • Opt*, we want to ensure that uncovered groups still pay p • Opt* in the following analysis.) 

The proof for bounding the latency is identical to the analysis for the minimum latency/traveling 
repairman problem ||6l[TTl. We include it here for completeness. Recall optimal solution i^* to the 
LPGST instance, where d{^*) = Opt* G (/3'"\j3']. For each / G [/], let N* denote the total weight 
of groups visited in ^* by time j8'; note that A'^^* equals the total weight of the groups covered by ^*. 
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Algorithm 4 Algorithm for LPGST 



1: guess an integer I such that j8'^' < length* < jS'. 
2: mark all groups as uncovered. 
3: for / = 1 . . . / do 

4: run the GSO algorithm (Theorem fTTI) on the instance with groups {X,}f^i, root r, length bound 
j8'+^ and profits: 

Jo for each covered group / G [g] 

^' ' \ Wj for each uncovered group / G [g] 

5: let T^') denote the r-tour obtained above. 
6: mark all groups visited by t''' as covered. 
7: end for 

8: construct tour t <r- t^^^ o t^^^ o • • • o t^'', the concatenation of the r-tours found in the above itera- 
tions. If d{T) < p ■ 15^ then then increase its length to p • jS' (this may only increase the latencies of 
groups). 

9: run the GSO algorithm on the instance with groups {Xi jf^i, root r, length bound j8', and unit profit 

for each group. Let a denote the resulting r-tour. 
10: output tour 71 := T o a as solution to the LPGST instance. 



Similarly, for each / G [/], let A^,- denote the total weight of groups visited in T^'' • • • T^'^ (i.e. by iteration 
/ of the algorithm). Set Nq = := 0, and W := ^f^j w, the total weight of all groups. The latency of 
tour T is upper bounded by T := ^grj L;=oiS'^^ • {W —Ni). Note also that the latency of optimal tour ^* 

is Lat*>^lUi8'(W-A^;). 

Consider any iteration / G [/] of the algorithm in Step|4] Note that the optimal value of the GSO 
instance solved in this iteration is at least Nj — A^,_i: the j3' length prefix of tour ^* corresponds to a 
feasible solution to this GSO instance. Theorem [TT] implies that the profit obtained in t''', i.e. A'^, — 
^,■-1 > i • - A^,-i), i.e. W-Ni<l- (W-Ni^i) + ^ • (W-N*). Using this, 

" (=0 

< iS2-w + i£j8'+2(w-iv;) + |£^'+2(w-A^,-_0 

< 0(l)-Lat* + ^£^'+2(W-A^;) 

^ i=0 

< 0(l)-Lat* + ^-(i3-l)- 

4 p 

Since j3 = |, this implies T = 0{p) ■ Lat*, giving the claim. ■ 

Claim 13 The tour o in Step^covers at least ^ groups and has length 0{p) ■ Opt*. 

Proof: Since we know that the optimal tour l^* has length at most j8' and covers at least h groups, it is 
a feasible solution to the GSO instance defined in Step|9] Hence the guarantees on the GSO algorithm 
from TheoremfTTlensure that the tour a has length at most pj3' = C?(p)Opt*. Moreover, the fact that 
we set unit profits for each group ensure that the solution returned visits at least h/4 groups. ■ 

Theorem 14 Tour n = T ■ o covers at least ^ groups and has latency 0{p) ■ Lat*. 
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Proof: Since n visits all the vertices in a (and maybe some more), Claim [13] implies that TT covers at 
least I groups. For each group / € [g], let a,- denote its arrival time under the tour T obtained in Step [8] — 
recall that the arrival time a,- for any group / that is not covered by T is set to the length of the tour d{T). 
Claim [T2] implies that the latency of tour T, ^f^j w, • a,- = 0{p) • Lat*. Observe that for each group / that 
is covered in T, its anival time under tour 7i = z ■ G remains a,-. For any group j not covered in T, its 
arrival time under T and under tt = T • a are both 0(p) • Opt*: since length of ;r is 0(1) times length of 
T (here we use the increase in length of T from Claim[T2l). Hence, the arrival time under n of each group 
/ G [g] is 0(1) • a,-, i.e., at most a constant factor more than its anival time in T. Now using Claim [12] 
completes the proof. ■ 

Proof of Theorem [4) By Theorem [T4] above, the tour n covers h/4 groups, and has latency 0{p) • Lat*. 
But p = C?(log^?i) by Theorem [TT] which completes the proof of Theorem[4] ■ 

Remark: The above arguments also give an 0(log^ ?i)-approximation algorithm for the minimum latency 
group Steiner problem, which is the same as the LPGST problem when the target h = g. Note that 
the objective is to minimize the sum of weighted arrival times where every group has to be visited. 
Indeed, the algorithm is simpler, since for this problem we can just repeat Step [4] until all groups are 
covered, instead of stopping after / iterations. A proof identical to that in Claim[T2]iniplies an 0(log^?i) 
approximation. We claim this is the best possible unless we improve the known approximation for the 
standard group Steiner problem, which is a long-standing open problem. Indeed, it is shown in f28l that 
an a-approximation to latency group Steiner implies an 0{a • log^)-approximation to group Steiner 
tree; thus improving this 0(log^ n)-approximation for latency group Steiner would also improve the best 
known bound for the standard group Steiner tree problem. 



5 Optimal Decision Tree Problem 

In the optimal decision tree problem |[24] [T][3l. we ai^e given a set of m diseases with associated proba- 
bilities {pi}1Li that sum to 1, and a collection of n binary tests with non-negative costs {cj}"^^. 
Each test j G [n] is a subset Tj of the diseases that correspond to passing the test; so performing test j 
distinguishes between diseases Tj and [m] \ Tj. 

Definition 4 A test strategy S is a binary tree where each internal node is labeled by a test, and each 
leaf node is labeled by a disease such that: 

• For each disease i G [m\ define a path Ki in Sfrom the root node to some leaf as follows. At any 
internal node, if i passes the test then Hi follows the right branch; if it fails the test then Hi follows 
the left branch. 

• For each i G [m], the path Kj ends at a leaf labeled disease i. 

The cost Li of a disease i G [m] is the sum of test-costs along path Ki; and the cost of the test strategy S is 
Y!iLi Pi ■ Li. The objective in the optimal decision tree problem is to compute a test strategy of minimum 
cost. 

Observe that the optimal decision tree problem is a special case of Isolation: given an instance of optimal 
decision tree (as above), consider a metric {V,d) induced by a weighted star with center r and n leaves 
corresponding to the tests. For each j G [n], we set d{r,j) = y. The demand scenarios are as follows: 
for each / G [m], scenario / is {j G [n] \ i G Tj}. It is easy to see that this Isolation instance con^esponds 
exactly to the optimal decision tree instance. Based on this, it suffices to focus on Isolation in weighted 
star-metrics, and we obtain an improved bound as follows. 
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Theorem 15 Group Steiner orienteering on weighted star metrics has a (1 — ^) approximation algo- 
rithm. 

Proof: Assume, without loss of generality, that the root r is the center of the star. Consider an instance 
of group Steiner orienteering (GSO) on weighted stai^-metric (V,<i) with center r and leaves [n], g groups 
{Xi C [?i]}f^i with profits {v,}f^j , and length bound B. If for each j G [n], define set Sj := {/ G [g] \ j G X,} 
of cost Cj := ^^k^, then solving the GSO instance is the same as computing a collection K C [n] of the 
sets with Y.jeK'^j < 5/2 that maximizes f{K) := ^{v, | / G VJj^KSj}- But the latter problem is an instance 
of monotone submodular function maximization over a knapsack constraint (Lye/fCy <B/2), for which 
a 1 — ^ approximation algorithm is known |[32ll . ■ 

Since the proofs in Section |42] use the results for GSO as a black-box, we also get 

Corollary 16 There is an (0(1), 0(1)) bicriteria approximation for LPGST on weighted star metrics. 

Setting Plpgst = 0{\) (from Corollary [T6l) in the analysis of Section [3^ we obtain an C?(Plpgst • 
logm) = C?(logm) approximation for Isolation on weighted star-metrics. 

Theorem 17 There is an 0{logm) -approximation algorithm for the optimal decision tree problem. 

Multiway tests. Chakravarthy et al. fS\ considered the optimal decision tree problem when the outcomes 
of tests are multiway (not just binary), and gave an (log m) -approximation under unit probabilities 
and costs. We observe that our algorithm can be easily extended to this problem with non-uniform 
probabilities and costs. In this setting (when each test has at most / outcomes), any test j G [n] induces 
a partition {7y^}[^i of [m], and performing test j determines which of the parts the realized disease lies 
in. Firstly note that this problem is also a special case of Isolation. As before consider a metric {y,d) 
induced by a weighted star with center r and n leaves corresponding to the tests. For each j G [n] , we set 
d{r,j) = ^. Additionally for each j G [n], introduce / copies of test- vertex j, labeled (7, 1), • • • , {j,l), at 
zero distance from each other. The demand scenarios are defined naturally: for each / G [m], scenario / 
is {{j,k) I / G Tj^}. Clearly this Isolation instance is equivalent to the (multiway) decision tree instance. 
Since the resulting metric is still a weighted star (we only made vertex copies). Corollary [16] and the 
algorithm of Section [l!2l imply an C?(logm)-approximation for this multiway decision tree problem. 

6 Adaptive Traveling Repairman 

We now extend the results on AdapTSP and Isolation to the adaptive traveling repairman problem 
(which is the latency version of adaptive TSP). The input to AdapTRP is the same as AdapTSP and 
Isolation: a metric {V,d) with root r G V, a distribution ^ supported on m distinct subsets {5,}"^^ with 
respective probabilities {pi}"L^. 

Definition 5 (The AdapTRP Problem) Given metric (V, d), root r and demand distribution the goal 
in AdapTRP is to compute a decision tree T in metric {V,d) with the added conditions that: 

• the root of T is labeled with the root vertex r, and 

• for each scenario i G [m], the path P^^ followed on input Si contains all vertices in Si. 

The objective function is to minimize the expected latency Y^iLi Pi ' Lat(Ps,), where Lat(fs.) is the sum of 
arrival times at vertices Si along path Ps-. 

The main difference while solving the latency problem is that we can not just isolate the realized 
scenario and then visit the vertices of the realized scenario. This is because all the unvisited vertices 
of the scenario have to pay the entire duration of isolation, and a large group isolated very late would 
incur a high latency objective. For example, consider an instance of AdapTRP (and Isolation) defined 
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on a stai^-metric with center r and leaves {v, mi , • • • , u„}. Edges (r, Ui) have unit length (for each / G [n]), 
and edge (r,v) has length ^/n. There are ni = n + 1 scenarios: scenario = {v} occurs with 1 — ^ 
probability, and for each / G [n], scenario Si = {v, m,} occurs with ^ probability. The optimal Isolation 
solution clearly does not visit vertex v (it appears in all scenarios, hence provides no information). So if 
we first follow this strategy for Isolation, the arrival time for v is at least n (under scenario Sq); since 5o 
occurs with 1 — o(l) probability, the resulting expected latency is D.{n). However, the strategy that first 
visits V, and then vertices • • • ,u„} has expected latency 0{^^\ On the other hand, it is easy to see 
that one can not ignore the Isolation aspect of the AdapTRP either. 

Hence, in general we need to interleave the two goals of isolating a scenario and visiting more 
vertices to prevent the latency from blowing up. Like in the AdapTSP's isolation algorithm, the idea 
is to recursively create sub-instances of smaller size to isolate the realized scenario, while also visiting 
other vertices. For the recursion step, we again use a partitioning algorithm like in the isolation problem; 
however it is more involved in this case since we need to simultaneously isolate scenarios and visit high- 
probability vertices. We have two classes of vertices: Su are those which occur in more than half 
the scenarios, and 5^ are those which occur in fewer than half scenarios. The partitioning scheme is 
presented in Algorithm |5] 

Algorithm 5 PartnLat( {M,{q\)i^M) ) 
1: define F,, := {/ g M | v G 5,} for each v G V. 
2: let S ■= [JieMSi ^V,Sl:={u£S \ |f„| < ^}, and Sh := 

3: create an instance of the latency group Steiner problem with the following groups and weights: 
for each scenario S,- (with / G M), 

- The main group of scenario /, group Xj has weight n Si\pi and vertices [Si n Si) U {Sh \ Si). 

- For each v G 5; n 5//, group ¥>' has weight pi and vertices {v} U {Si n 5;) U {Sh \Si). 

4: run the latency group Steiner algorithm (Subsection l4.2l) on metric iy,d) with root r and groups as 
defined above. Note that the goal here is to cover all the groups, 
let t' := (r, vi , V2 , • • • , V,/ , r) be the r-tour returned. 
5: define a partition of M as follows: 

- scenario / G M is said to be covered by Vk if Vk is the first X,- vertex on t'. 

- let V;_i be the first vertex on t' after which /ewer than half the scenarios are uncovered. 

- forl<^<f — 1, part /\ comprises of all scenarios covered by V]^; and the last part Pf consists 
of all uncovered scenarios after Vt-\. 

6: return tour T = r,vi,V2, •• • ,Vf_i,r and the partition {PkYk=\- 



Given this partitioning scheme. Algorithm [6] for adaptive TRP is very similar to the one for isolation 
in AdapTSP — run the partition process and traverse that tour till (i) we visit a vertex in Si which 
appears in the current instantiation, f ii) we visit a vertex in Sh which does not appear, or ( in) we have 
"thrown away" half the scenarios. In each case, we can be sure that more than half the scenarios have 
been discai^ded, and we can then return to the root and recurse (while also getting rid of vertices already 
visited from the scenarios). 

6.1 The Analysis 

The analysis for this algorithm is similar to that for the isolation problem for TSP (Section 1X21 ) and we 
follow the same outline. For any sub-instance ^ of AdapTRP, let Opt(^) denote its optimal value. 
Just as in the isolation case (Claim [8]l, it can be easily seen that the latency objective function is also 
sub-additive. 
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Algorithm 6 AdapTRP(M, {qi}ieM) 
1: If \M\ = 1, visit the vertices in this scenario in (approximately) latency minimizing fashion, and 
quit. 

2: run PartnLat(M,{^;};eM) 

let T = (r, vi , V2, • • • ; Vf_i , r) be the r-tour and be the partition of M returned. 

3: let q'j := Lgp^ <?,• for all j G 1 . . . 

4: traverse tour t and return directly to r after visiting the first (if any) vertex V)t* (for k* ^ [t — I]) that 

determines that the realized scenario is in P^* C M. If there is no such vertex, set k* ■(^ t. 
5: Update the scenarios in /\* by removing vertices we have already visited in T until Vk*- 
6: run AdapTRP(/\* , {-x-},ePj, ) to recursively cover the realized scenario within 



Claim 18 For any sub-instance {M,{qi}i^M) and any partition {Pk}[^i ofM, 

lUi^^Opt((A,{|},-eP,)) < Opt((M,te},eM)), (6.7) 

where q'^ = LiePt 9/ all I <k <t. 

The next property we show is that the latency cost of the group Steiner instance considered in Step|4] 
of Algorithm [5]is not too high. In what follows, let ^ denote the latency group Steiner instance consid- 
ered. 

Lemma 19 For any instance ^ = {M,{qi}i^M) o/ AdapTRP, the optimal value of the latency group 
Steiner instance considered in Step^of Algorithm PartnLat(^) is at most Opt(^). 

Proof: Let T be an optimal decision tree for the given AdapTRP instance Note that any internal 
node of T, labeled v, has two children corresponding to the realized scenario being in Fy (yes child) or 
M\Fy (no child). Now consider the root-leaf path in T (and coiTcsponding tour a in the metric) which 
starts at r, and at any internal node v, moves on to the no child if v G Sl, and moves to the yes child if 
V £ Sh- We claim that this tour is a feasible solution to the latency group Steiner instance created. 

To see why, first consider any scenario / G M that branched off from path a in decision-tree T; 
let V be the vertex where the tree path to the scenario branched off from a. If v G 5l, then by the 
way we defined a, it must be that v belongs to 5,- n Sl, because otherwise the tree path to the scenario 
would not branch off a at v. On the other hand, if v G S//, then it must be that v £ Sh\ Si (again from 
the way a was defined). In either case, v G (S, n Sl) U {Sh \ Sj), and visiting v covers all groups (i.e. 
Xi, {yy I V G SiHSH}) associated with scenario /. Thus a covers all groups of the scenarios that branched 
off it inr. 

Note that there is exactly one scenario (say a G M) that does not branch off a; scenario a traverses a 
in T. Since T is a feasible solution for AdapTRP, a must visit every vertex in Sa- Therefore a covers all 
the groups associated with scenario a: clearly {yj | v G SaHSn} are covered; Xa is also covered unless 
Sar\SL = ® (however in that case group X^, has zero weight). 

We now bound the latency cost of tour a for instance Let a, (for each / G M) denote the coverage 
time for group X/, and py (for / G M and v G SiHSn) the coverage time for group Vy. The next claim 
shows that the latency of a for instance is at most Opt(^). 

Claim 20 The expected cost of the optimal solution for the AdapTRP instance ^ is at least Y^i^m Pi ' 
\SL^Si\ ■ (Xj + J^it^M Pi ' Pi> which is the average latency of tour o for the latency group Steiner instance 

Proof: Fix any i £ M; let a,- denote the shortest prefix of a containing a vertex from Xj (note that by 
definition a,- has length a,). Since all but the last vertex in a,- are from (5l \5;) U (5// n5,), by definition of 
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(7, the path traced in the decision-tree T when scenario / is realized agrees with a, . Hence under scenario 
Si, the total arrival time (i.e. latency) for vertices Sir\Si is at least jS'LnS'il • a,-. So the contribution of Si 
towards the expected total latency in Opt(^) is at least pi • j^/, n 5,1 • a,. 

Now consider some vertex v S 5, H 5//; let oj denote the shortest prefix of a containing an l^-^'- vertex 
(note has length j8/'). Again by the definition of a, when scenario / is realized, vertex v is not visited 
before tracing path oj in the decision tree T . So the contribution of v (under scenario /) to expected total 
latency is at least pi • j3/'. ■ 
Thus we have demonstrated a feasible solution to of (weighted) latency at most Opt(^). ■ 

It remains to bound the expected additional latency incurred in Step|4]of Algorithm [6] when a random 
scenario is realized. 

Lemma 21 At the end o/5'tep|?]o/ AdapTRP(M,{^,},g^/), the realized scenario lies in /\* (as deter- 
mined in Step^. The expected additional latency incurred in this step is at most C?(log^ n) ■ Opt( (M, {qi}ieM 

Proof: Consider a particular scenario / G M. If this is the scenario that has realized, the algorithm would 
traverse tour T until it hits the vertex Vk such that / G Pk*. By the definition of the partition algorithm, 
this is the first point in T where we visit a vertex in [Si n Sl) U {Sh \ Si). We show below that the 
expected additional latency incurred by vertices in Si in Step |4] of Algorithm [6] is at most twice the total 
(weighted average) latency of groups | v G 5; n 5//} in solution T to the latency group Steiner 

instance W. To see this, observe that any vertex visited along the way before would incur only the 
length till it was visited (correspondingly, the group F/' would incur exactly the same latency in tour the 
solution for instance All other vertices incur a latency of the entire prefix until Vyt: this is exactly 
the latency incurred by the groups X, and Y-' (for unvisited vertices v) in the tour a for the latency group 
Steiner instance Furthermore, the unvisited vertices would incur an additional latency in this step, 
since we go back to the root before recursing. Therefore, the total expected latency accrued in this 
step is at most twice the latency of T w.r.t the group Steiner instance W. The proof then follows from 
Lemma[l9l and because the latency group Steiner problem admits an O(log^n) approximation algorithm 
(see Subsection l42l ). ■ 
Since in each level of the recursion we halve the number of possible scenarios, the depth of this 
is at most log|M|. This combined with Lemma |2T] and Claim [T8l gives us the following result (by an 
identical proof as in Theorem |9ll. 

Theorem 22 There is an 0{log^n ■ \ogm) -approximation algorithm for AdapTRP. 

There is still a logarithmic gap between our approximation bound, and the Q.{\og^^^ n) hardness of 
approximation |[28l . for AdapTRP on tree-metrics. Closing this gap is an interesting open question. 
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A Hardness of Approximation for Ad a pTS P 

We show that AdapTSP is at least as hard to approximate as group Steiner tree. 

Theorem 23 An a-approximation algorithm for AdapTSP implies an a + o{\) approximation algo- 
rithm for group Steiner tree. Hence AdapTSP is Q(log^^^?i) hard to approximate even on tree metrics. 
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Proof: This reduction is almost identical to the reduction HI from Set-cover to the optimal decision tree 
problem; we give a proof in context of AdapTSP for completeness. 

Consider an arbitrary instance of group Steiner tree on metric {V, d) with root r and groups Xi,--- ,XgCl 
V; let Opt denote its optimal value. Assume without loss of generality that X,- 7^ Xj for all / 7^ j, and 
the minimum non-zero distance in d is one. We construct an instance of AdapTSP as follows. Let 
V' = VU {s} where 5 is a new vertex (representing a copy of r), and define metric d' on V' as: 



There are g + 1 scenarios in the AdapTSP instance: 5, := Xi U {s} for / G [g], and Sg+i := {s}, with 
probabilities 



Above L S> 2« • max„ ,,(i(M, v) is some large value. The root in the AdapTSP instance remains r. Let 
Opt' denote the optimal value time of this instance. We will show that Opt < Opt' < Opt + 1 which 
would prove the theorem. 

(A) Opt < Opt'. Consider the optimal strategy for the AdapTSP instance; let a denote the r-tour tra- 
versed by this strategy under scenario Sg+i. We now argue that a is a feasible solution to the group 
Steiner tree instance. Suppose for a conti'adiction that a does not visit any X,- vertex for some / G [g]. 
Then observe that the r-tour traversed by this strategy under scenario S, is also a, since the strategy 
can not distinguish scenarios Si and Sg+\ (the only way to do this is by visiting some X,-vertex). How- 
ever this violates the requirement that the tour (namely a) under scenario Si must visit all vertices Si 5 X;. 

(B) Opt' < Opt+ 1. Let T denote an optimal r-tour for the given GST instance, so d{x) = Opt. Consider 
the following strategy for AdapTSP: 

1. Traverse r-tour T to determine whether or not X^+i is the realized scenario. 

2. If no demands observed on T (i.e. scenario Sg^i is realized), visit vertex s and stop. 

3. If some demand observed on T (i.e. one of scenarios is realized), then visit all vertices in 
V along an arbitrary r-tour and stop. 

It is clear that this strategy is feasible for the AdapTSP instance. For any / G [g + 1], let 71,- denote the 
r-tour traversed under scenario 5, in the above strategy. We have ^(71^+1) < d{z) < Opt, and d{ni) < 
2n ■ maXj, ,,<i(M, v) < L for all / G [g]. Thus the resulting AdapTSP objective is at most: 




for u,v £V 
for M G V, v = s 






Thus we have the desired reduction. 
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